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THE THEORY OF DISCRIMINATION 


by 

Kenneth J. Arrow 

1* Introduction 

The fact that different groups of workers, be they skilled or 
Unskilled, black and white, or male and female, receive different wages, 
invites the explanation that the different groups must differ according. , 
to some characteristic valued on the market. In standard economic theory, 
we think first of all of differences in productivity. The notion of 
discrimination involves the additional concept that personal characteristics 
of the worker that are unrelated to productivity are also valued on the 
market. Such personal characteristics as race, ethnic background, and 
sex have been frequently adduced in this context. 

Discrimination in this paper is considered only as it appears on 
the market. Obviously, one can have discrimination in the same sense 
whenever decisions are made that concern other individuals, namely, that 
their personal characteristics other than those properly relevant enter 
into the decision. Deliberate racial segregation and discrimination in 
entrance to schools and colleges, deprivation of the right to vote along 
social and sexual lines, and discriminatory taxation are all examples 
of discrimination In non-market discrimination. 

It may as well be admitted that the term, "discrimination, 11 has 
value implications that can never be completely eradicated, though they 
can be sterilized for specific empirical and descriptive analyses. 
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I have spoken of personal characteristics that are "unrelated to pro¬ 
ductivity" and not "properly relevant." These terms Imply definitions 
of product and of relevancy which are themselves value judgments or at 
^ a te decisions by the scholar. The black steel worker may be thought 
of as producing blackness as well as steel, both evaluated in the market. 

,We are singling out the former as a special subject for analysis because 
somehow we think it appropriate for the steel industry to produce steel 
and not for it to produce a black or white work force. 

■ ( However, the value judgments are intrinsic only in determining which 
wage differences we regard as worth studying as an example of discrimination, 
not in the empirical or theoretical analysis of any form of discrimination 
once specified. 

In the following, I will address myself specifically to racial 
discrimination in the labor market. For the most part, the analysis 
extend with no difficulty to sexual discrimination, with some reservations 
to be noted. The other markets in which discrimination has been most 
observed, especially housing but also insurance and capital, are analyzed 
by the same general methods, but the operation of these markets has led 
to a greater degree of simple exclusion and less of price differentials. 

The basic aim here is to use as far as possible neoclassical tools in 
the analysis of discrimination. As will be seen, even though the basic 
neoclassical assumptions of utility and profit-maximization are always 
retained, many of the usual assumptions will be relaxed at one point or 
another: convexity of indifference surfaces, costless adjustment, 
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perfect information, perfect capital markets. As X will try to show, 
the abandonment of each of these assumptions is motivated by a clearly 
compelling reason in the theoretical structure of the subject. Personally, 
I*believe there are many other economic phenomena for the explanation of 
which each of these assumptions must be abandoned, so that the steps pro¬ 
posed here are not ad hoc analyses for the case at hand but should be 
important elements in a more general theory capable of analyzing the 
effects of social factors on economic behavior without either lumping them 
into an uninformative category of "imperfections" or jumping to a precipi¬ 
tate rejection of neoclassical theory with all its analytic power. 

The first application of neoclassical theory to discrimination that 
I know of is that of Edgeworth [1922], but the main study to date has 
been that of Becker [1959]. The analysis to be presented here appears 
in a more technical form in Arrow [1971]. It seeks to develop further 
Becker*s models and relate them more closely to the theory of general 
competitive equilibrium, though frequently by way of contrast rather 
than agreement. 

Since I am presenting here the theory of discrimination in the labor 
market and not the entire theory of racial differences in income, I 
abstract from differences in productivity between the groups of workers. 

In an empirical study, it will be necessary to allow for this possibility. 
In the case of blacks and whites, some possible causes of productivity 
differences have been established (differences in educational quantity 
and quality, family size, and household headed by woman; see Duncan [ ]) 



and others surmised (culturally varying attitudes toward work and 
future-orientation, derived from the heritage of slavery and other 
historical factors)» These differences themselves may be the result 
of discrimination in other areas of life. But for theoretical analysis 
of discrimination in the labor market, it is legitimate to assume that 
there are two groups of workers, to be denoted by B and W, which are 
perfect substitutes in production. 

For the simplest model, then, we have a large number of firms all 
producing the same product with the same production function. Discrimination 
means that some economic agent has some negative valuation for B or positive 
valuation for W or both, a valuation for which he is willing to pay and 
has the opportunity to pay. The agents who could possibly discriminate are 
the employer, who might sacrifice profits to reduce or eliminate B 
employment in his plant, or the W workers who might accept a lower wage to 
work in a plant with more W and less B workers. (It is also possible for 
products sold on a face-to-face basis, that customers might discriminate 
by being willing to pay higher prices to buy from whites; this case could 
be studied along similar lines but will not be dealt with here.) Not 
flll discriminatory feelings can find expression in the market; an 
entrepreneur who has a distaste for competing against firms with B workers 
has no way, within the economic system at least, of expressing his tastes 
and therefore of influencing wage levels. 

^Although my concern here is with discrimination and not with productivity 
differences, I must note my skepticism about the frequently—made argument 
that blacks have lesser future-orientation. For this disregards the well- 
known fact that of any group income level blacks save at least as much as 
whites. 



I assume that, given the tastes, the markets work smoothly. General 
equilibrium requires full employment of both B and W workers; the wages 
of both will adjust to clear the market, and the discriminatory tastes, 
will be reflected in wage differences. 

Let us just consider the simplest case, that in which the employer 
discriminates. Then he accepts a trade-off between profits, 7r, and the 
numbers of B and W employees. That is, we suppose he seeks to maximize, 
not profits, but a utility function, U (it, B, W) . We assume to get the 
simplest case, that there is only one type of labor; in the short run, 
we also take capital as given, so that output is f (VH-B), since the two 
kinds of labor are perfect substitutes (at a 1-1 ratio). If we take output 
as numer aire, then profits are given by the expression 

tt - f (W+B) -w, W-w_ B, (1) 

w o 

where w TT and w_ are the wage rates, taken as given by the employers. If 
we proceed along conventional lines, the employer equates the marginal 
productivity of each hand of labor to the price to him. But here the 
"price" of B labor is the market price, W^, plus the price the employer 
is willing to pay, in terms of profits, for reducing his B labor force by 
one. This second term is what Becker has termed the "discrimination 
coefficient," to be designated as d^; it is the negative of the marginal 
rate of substitution of profits for B labor. If, as we usually suppose, 
the marginal utility of B labor is negative, then the discrimination 
coefficient, d n , is positive. 
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dfWhere dg " - MR^, fi • Similarly, 

; “ W w +d w • 


( 2 ) 


(3) 


where is negative (or zero if the employer has no positive liking 
for having W workers). But we are assuming that the two types of labor 
are interchangeable in production, so that MP^ - MP fi ■ MP L » say. Then, 
from (2) and (3), w R + d dw - w fi + d fi , or 

! ’ M w ■ W B “ d B • S > 0 • W 

so that equilibrium requires that W wages exceed B wages, as might be 
expected. 

For the moment, assume that all firms have the same utility function, 
U (tt, B, W,). It then appears reasonable to assume that all hire the 
same amounts of B and W (but we will return to this point in the next 
section). Then each firm’s labor force is the same, and the allocation 
of labor is efficient. The effects of discrimination are purely dis¬ 
tributive. The most obvious implication then is that B workers are paid 
less than their marginal product, so that the W workers and employers 
together gain. Also, the W workers clearly gain, or at least do not lose, 
from (3), with » 0. The effect on profits, however, depends on the 
exact nature of the utility function. Under the assumption made, it 
follows from (1-3) and the fact that MP^ ■ MPg • MP^ that, 
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| Ui> , 11 “ *(!<) - <MP L )L + d w W + d B B * (5) • 
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ff where L ■ W+B, the total labor force of the firm. If there were no 

discrimination, profits would be, 

- f(L) - (IIP l )L, 

S and therefore the change in profits is simply, 
o ir - tt o - dyW + d fi B . (6) 

(: f ~ r v 

The right-hand term has a single interpretation. If we consider an 

V •' h:.;l , 

* '> increase in the firm's labor force with the proportions of W and B workers 

, tic;, v, , 

constant, then the negative of the marginal rate of substitution of 
profits for this balanced increase is simply (W/L) + dg (B/L); this 
is the firm's need for additional profits to compensate it for a balanced 
increase in size. This term may of course be positive or negative. 

However, a plausible hypothesis which we shall maintain hereafter 
is that employers'ssatisfactions depend only on the ratio of B to W workers. 
In that case, 

djj W + d fi B = 0 , (7) 

and (6) tells us that employers neither gain nor lose by their discriminatory 
behavior. The entire effect is that of a transfer from B to W workers. 

Let us now relax the assumption that utility functions are identical 
among firms. We continue to assume that for each firm, the utility 
depends only on the ratio of W to B workers, but some firms may be more 
discriminatory than others, in the sense that the marginal rate of 
substitution of profits for B workers will be more negative at any given 
ratio, B/W. Equation (4) and (7) hold for each firm, at least each firm 



;f; : that employs both types of workers. They can be regarded as a pair of 
k linear equations in W and B, to yield, 

& i -* r * 

|/ W/L - d B /(w w - w fi ); B/L - -d w /(w w - w fl > • 

v • ■ 

Since d™ > 0, if there are both B and W workers, it must be that w^ > w g» 

* B 

as before. We will observe firms with different ratios of W to L. The 

i 

firms that display the most discrimination at the margin, i.e., the 
highest values of dg, have the highest ratios of W to L. Thus an observa¬ 
tion on all the firms in existence at equilibrium will reveal a dispersion 
of W-proportions in the labor force, and these ratios will measure the 
varying degrees of discrimination. Thus a partial degree of segregation 
appears; the B workers tend to be found in the less discriminatory 
firms, the W workers in the more discriminatory ones. 

However, further analysis leads to implications which might raise 
some empirical questions. Specifically, equation (2) still holds, with 
MP_ ■ MP-, Hence, according to the model, HP- is higher for more 
discriminating firms. But then if we assume diminishing marginal pro¬ 
ductivity of labor, it follows that the less discriminatory the firm the 
larger it will be. This accords with common sense; discrimination is 
costly to the entrepreneur and acts like a tax on him, since it shifts 
his demand for labor to the more costs component. Hence it restricts 
his scale. 

Since MP is no longer the same from firm to firm, it follows that 
Li 

production is no longer efficient. The previous strong statements 
about the incidence of discrimination no longer hold exactly either. 
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However, their general thrust is still probably correct. Efficiency 
losses are not apt to be great, and the main redistribution is still 

V v £• 

likely to be from B workers to W workers. 

It has been seen that competition tends to reduce the degree of 
discrimination in the market, in the sense that the unweighted average 

.jl .X 

of discrimination coefficients of the different firms exceeds the average 
weighted in proportion to the number of workers. 

This result, which may or may not be empirically reasonable, appears 
more strongly and less likely when one pushes the analysis in the long run. 
Now we are assuming that capital, which has been hitherto held fixed, 
is adjusted optimally to the size of the labor force. Then capital will 
flow to the. more profitable enterprises which, in this context, are the 
less discriminatory. In the long run, output is therefore simply 
proportional to labor (assuming the production function displays constant 
returns to capital and labor). The marginal product of labor is then 
constant. As a result, the competitive effect just studied assumes an 
exaggerated form. Only the least discriminatory firms survive. Indeed, 
if there were any firms which did not discriminate at all, these would 
be the only one to survive the competitive struggle. Since in fact 
facial discrimination has survived for a long time, we must assume 
that the model just presented must have some limitation to which we 
will return in section 4 below. 

We have dealt extensively with the assumption of discrimination by 
employers. But, as we observed earlier, discrimination by co-workers is 



also a possibility. The most straightforward extension of the preceding 

.. •i- v.'Shl v u-i. « 

analysis is to the case of complementary services. Suppose now there are 

V wu,: .. 

two kinds of workers, say foremen and floor workers. It is the foremen 

|<Vj- ;j! 

who like working with W's and dislike working with B's. As before, we 

assume that the likes or dislikes are governed by the ratio of W to B 

On. ■ 

floor workers. Each foreman then chooses among alternative employment 
opportunities on the basis of both wages and the W/B ratio. Assume that 

SC/' K' 

all foremen have the same utility function. 

■ ‘ The equilibrium in this model is a trifle unorthodox. Instead of an 

equilibrium wage for foremen, there is an equilibrium relation between 

foremen's wages and W/B ratios in firms. Every firm must lie on this 

curve, and the equilibrium curve will be one of the foremen's ~ 

indifference curves between wages and W/B. 

Let F be the number of foremen, and w^, their wage. Then the firm 

faces fixed W tT and W_ for the floor workers and a fixed relation, 

W U 

W F “ W F^ ^ 


where L W B is the total floor force. The firm's short-run profits 


are defined by, 

w » f (L, F) - w y W - w fi B - WpF, (9) 


where it is assumed, as before, that W and B floor workers are perfect 


substitutes. 


Assume now that firms have no discriminatory tastes. They seek 
only to maximize profits. They will still not hire B workers at equal 
wages with W since an increase in W decreases the wages and therefore the 
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cost of F, while an increase In B decreases the cost of F. Hence a 

s \ 

W worker is worth more than his marginal product, while a B worker is 

<C .V .1 

worth less, exactly as in the case of employer discrimination. Further, 
the extent of the premiums over or deficits from marginal product depend 
only on the ratio of W to B. Hence, the previous analysis applies with 
suitable modifications. W workers are paid more than their marginal product, 
B workeres less. If all firms wind up with the same levels of W and B, 
then the results are entirely parallel to those for employer discrimination: 
productions remains efficient, and the entire incidence .of the foremen's 
discrimination falls negatively on the B workers and postlvely to an equal 
extent on the W workers. 

As in the case of employer discrimination, the extent of the wage 
difference between B and W workers depends on the extent of discrimination. 
The precise formula is of some interest. Recall that, by (8) w_, is a 
function of the ratio, W/L. By w'p, I will mean the derivative of Wp 

l • 

with respect to this ratio (this is negative). Then w' /w_ is the 
proportional rate of change of the demanded wage rate (along the 
equilibrium indifference curve between foremen's wages and W proportion 
in the floor force) and therefore is a measure of discriminatory tastes. 

Let Sp be total payments to foremen, total payments to floor workers. 

Then the following has been shown in Arrow [1971, Technical Note B]. 

"w - W B _ fF 

“'l w f s l 


( 10 ) 
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The left-hand side is the market wage differential due to dis¬ 
criminatory tastes of foremen relative to the wage level in the absence 
of discrimination. 

There is an interesting aspect of this formula. Given the degree of 
discrimination as measured by -w'p/wp, the observed wage differential 
depends on the ratio S^/S^. That is, the more important the share of 
foremen in the output of the firm relative to floor laborers, the 
greater the wage differential. 

The language of the preceding analysis has assumed that it is the 
foremen or other supervisory employees who discriminate according to 
the composition of the floor workers. But the analysis itself is com¬ 
pletely abstract. It may be illuminating to reverse the roles. Suppose 
that production workers have strong discriminatory feelings about their 
supervisors. Certainly the idea that white workers strongly resent 
being bossed by black supervisors or male workers by female foremen 
(foreladies? forepersons?) is a common one. Then if in (10) we 
understand by W and B those kinds of supervisory workers, by the total 
number of such workers, and by F the floor workers, we have an excellent 
explanation of discrimination against B supervisory workers, for S^, 
then would be Very large indeed compared with S^. 

Foremen may possibly differ in their tastes for discrimination. One 
might suppose that this will lead to a reduction in market wage differentials, 
analogous to the situation with employmr discrimination. But a fuller 
analysis of this case remains to be done. 
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2• s Nonconvexities in Indifference Surfaces and Opportunities 

I have gradually become convinced that the usual assumptions that 
indifference surfaces are convex is inapplicable to the case of racial 
discrimination and indeed to many other problems in the economics of 
externalities. Pollution provides another example; Starrett [1971] has 
already pointed to the importance of nonconvexity in this context. 
Assumptions which seem very reasonable in the contexts of discriminatory 
behavior necessarily imply a nonconvexity of the indifference surfaces 
of the firms in the case of employer discrimination or of the firm’s 
profit function in the case of discrimination by complementary workers. 

• Actually, my view is that the nonconvexity of indifference surfaces 
is in fact a widespread phenomenon. An excellent example in commodities 
with no externalities is residential location. One could after all live 
half the time in one place and half in the other. Convexity implies that 
such an arrangement would be at least as good as the least preferred of 
the two locations. If one is indifferent to the two, then he will prefer 
the mixture. In fact, taken literally, convexity would imply that 
individuals would be willing to spend half of any minute in one place and 
half in the other. But (except for a few ’’beautiful people"), most 
individuals find it preferable to live in one place, even though there 
may be another to which they are indifferent. 

Indeed, if one looks through the literature, it is hard to find a 
convincing intuitive explanation of convexity of indifference surfaces. 

The best argument is that convexity is a necessary and sufficient condition 
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for'the continuity of demand functions. But this argument applies only 
to individual demand functions. Since each individual is small on the 
scale of the entire market, even the largest discontinuity in an individual 
demand function implies a negllgable discontinuity in .the market . demand 
function.. Hence, observations which suggest approximate continuity in 

Hi *.- 3 \ t 

market demand functions in no way imply convexity of indifference surfaces. 

.■ V r v 

In particular, the existence of general competitive equilibrium remains 
unaffected, or, to be precise, the existence of an approximate equilibrium 
of supply and'demand on all markets can be demonstrated. (This line of 
argument was suggested initially by Farrell [1959] and subsequently 
developed by Bator [ ], Rothenberg [ ], Auraann [1964], and Starr [ ] 

for one exposition, see Arrow and Hahn [1971, Chapter 7]. 

It is true that the market demand function, if it is effectively 
continuous, can be derived by adding up a new set of Individual demand 
functions, each derived from a "convexified" indifference map obtained 
from the original by filling in all the holes in the indifference surfaces. 
From the point of view of prices and total market quantities, the newly 
formed indifference map predicts as well as the original does, and 
therefore one might be tempted to assume that one could act "as if" 
indifference surfaces were convex, though with some flat surfaces. But 
there is a loss of information, for the distribution of goods among 
individuals is quite different from what it would be if all individuals 
had convex indifference surfaces. Thus, in our residential location 
example, the market totals (how many people-hours are spent in each 
place) and the rents in the two places are well predicted by the convex 
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approximation . But recognizing the underlying nonconvexities enable 
us to predict that half the people will be in one place all the time 
and half in the other, instead of each individual's spending half his 

jj 

timein one place and half in the other. 

Let me give a brief diagrammatic illustration. Suppose every 
Individual has the same indifference map, as given by Figure 1, and 

i 

the same initial endowment, represented by A. One's initial reaction, 

conditioned by years of working with convex indifference maps, is 

to assume that there is no trade; since all individuals are alike in 

every economic respect, they should wind up alike, which in this case 

means each with his own initial bundle. But this is clearly false. In 

fact the equilibrium can be obtained as follows: convexlfy each 

indifference curve by filling in the hole with a straight line segment 

tangent to the curve at both ends, as, for example, the segment BC on 

curve I q . Now we see that if we pretend for the moment that the 

convexified map is the true indifference map for each individual, then 

each individual winds up on the convexified curve I . Since this 

o 

curve is flat at the point A, the price ratio is determined by the slope 
of BC. Now return to the individual, who has the original indifference 
curve I q . At these prices, he-will maximize utility at two different 
points, B and C, but not at any point in between. If, for example, A 
is half-way between B and C, the:, market equilibrium is realized by 
having half the individuals at B and half at C. If A is two-thirds 
of the way from B to C, the market equilibrium is realized by having two- 
thirds of the individuals buy the bundle C and one-third the bundle B. 
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Note that each Individual is at a point of maximum utility for him 
subject to his budget constraint, so that this is truly a competitive 
equilibrium and therefore efficient. (The earlier reference to 
"approximate equilibrium" is relevant when there are not enough individuals 
to split them in the right proportions between B and C. Thus if A is .71 
of the way from B to C there are only 50 individuals in the economy, 
there should be 35-1/2 individuals at C and 14-1/2 at B. Thus, at C or 
B the discrepancy between supply and demand cannot be reduced below half 
an individual. This is relatively a minor discrepancy between supply 
and demand.) 

Thus non-convexity implies the existence of distinct niches for 
economic agents in a sense of the word which I take to be close to that 
used in ecology. One observes agents, identical in their economic data, 
engaged in diverse consumption patterns or other economic activities. Any 
given agent may be indifferent between several of these niches, but 
equilibrium requires their coexistence. This argument underlies Adam 
Smith's discussion of specialization, as opposed to Ricardo's which was 
based on differences in the productivities of individuals or nations; it 
has been made explicit in an important but neglected paper of Houthakker 
[ 1 . 

Let me now apply these abstract concepts to racial discrimination. We 
take up a model due to Becker [1959, Chapter 4] and in a different form 
to Welch [1967] and not analyzed above. Now we locate the discriminatory 
tastes in the W workers who are perfect substitutes for the B workers. 



To keep matters as simple as possible, assume there is only one kind of 
labor. Then, analogous to the assumption made about complementary forms 
of labor, we now assume that W workers have an indifference map r between 
wages and the proportion W, so that at equilibrium, there is a relation, 

V; 

w w - w w (W/L), (ID 

where w., decreases as W/L increases from 0 to 1. As part of profit 
maximization, the firm will certainly seek that combination of W and B 
which will minimize the cost of hiring whatever total number of workers, 

W + B * L, it does hire. This cost is 

C(W, B) - w w (W/L) W + w fi B. (12) 

But it is easy to see that a firm will always achieve minimum cost 
with either an all-W or an all-B labor force. The two might be equally 
cheap, but certainly any combination with W and B both positive will be 
more costly than at least one extreme case and possibly more costly than 
both. To see this, consider two cases: 

(a) w tt (l) • w D : Recall that W/L « 1 means an all-W labor force. 

Then for any W, 0 < W < L, w w (W/L) > w w (l) » w 0 , and therefore w w (W/L)W + 

w„B > w_(W+B) « w„L; hence, an all-B labor force, with cost w_L, is 
B B B ° 

cheaper than the mixture. An all-W labor force has a cost w w (l)L and then 

w t .(l)L » w^L, so the all-W labor force is at any rate no less costly; 

W B 

if w„(l) - w_, the two extreme cases are equally cheap. 

W B 

(b) w w (l) < w fi : Then if W <L, w w (W/L) W + w fi B > w w (l)Wtv B B > 

w„(l)(WfB) « w^d) L; the all-W labor force is cheaper than any other 
w w 


labor force. 




Hence, if w w (l) < w fi , every firm will find it cheapest to select 

all~B labor force, and if w^(l) > w^, every firm will minimize cost 

by hiring an all-W labor force. But equilibrium requires full employment 

of both types of workers. The equilibrium then requires w TT (l) " w D . 

But even then no firm will hire both W and B workers. We find, then, that 

at equilibrium every firm is segregated, but then the only observed wage 

for W workers is for those in all-W firms, i!*e., w TT (l), which is equal to 

w 

W B* Therefore, discrimination by W workers will not result in market 

■ I 

wage differentials but instead does result in segregation. 

In technical terms, the function, C(W,B) is not a convex function, 
specifically, the isocost curves in W-B space are not concave to the origin. 
Convexity implies a tendency to the middle, to compromise; but here we 
have a rushing to extremes. We also have the characteristic implication 
of nonconvexity, a dispersion of firms with basically Identical market 
opportunities into discrete niches. 

Now, in going back over the analyses of section 2, it can be observed 
that the case just discussed, of discriminatory tastes by a perfect 
substitute group of workers, is strikingly similar to that of discriminatory 
tastes by workers of a complementary type, the "foremen" of our example. 
Though a detailed analysis of the non-convexities in this case has not 
yet been made, it is clear that the profit function defined by (9) is not 
in general a concave function. Rather, the surface defined by profits as 
a function of W and B has holes scattered through it. Hence, it is at 
least possible that for certain values of w^ and w fi and some equilibrium 
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relation* W-.CW/L)* there are several distinct points of maximum profits, 
r 

Equilibrium on the three labor markets (W, B, and F) may be achieved 
by different amounts of these, even though each firm has the same 
production function and each faces the same wages for W and B and the 
same relation between w p and W/L. Thus, there will be a partial 
segregation by firms. 

1 i The relation (10) stated earlier still holds for each firm, so the 
previous conclusions remain valid. 

We can now reconsider the theory of employer discrimination. The 

utility function, U(rr, B, W), depends, it has been assumed, only on the 

ratio B/W. But it is shown in the Appendix that such a utility function 

2 

cannot possibly'have convex Indifference surfaces everywhere. Therefore 
it is possible and in fact likely that in the short run equilibrium will 
require the coexistence of firms of different sizes with different W/B 
ratios* even if all firms have same utility function. Thus at least 
partial segregation is a likely outcome of the utility-maximization 
theory. All the firms will have to have the same utility, so that the 
larger firms will be those with the larger proportions of Wuwbrkers 
since utility increases with x and with W/B. 

In the long run, indeed, it can be seen that with constant returns 
to scale, there must be perfect segregation and equality of wages. 

2 

The nonconvexity that arises when only ratios matter is of importance 
in the theory of pollution also. For the characteristic situation 
there is that the pollutee is faced with consuming air or water in which 
the proportion of pollutants is given to him. 



For suppose there is not perfect segregation, i.e., there is at least 



Since HP, is constant in the long run, it follows from the preceding 

section that all B workers will be in firms with the smallest dg and all 

W workers in firms with the (algebraically) smallest d^. Then all 

firms have the same dg and same d^ and therefore all have the same W/B 

ratio. The equilibrium values of w^ and Wg will be MP^ - d^ and MP^ - d fi 

respectively, where MP is the long-run marginal product of labor, 

L 

a constant. But then any firm which increases its B/W ratio slightly 
can make positive profits; by increasing its scale, it can make 
indefinitely large profits with only a slightly altered W/B ratio. 

It would therefore have a higher utility, so we have a contradiction 
to the existence of equilibrium with at least one integrated firm. 

Hence, all firms are segregated. In an all-B firm, d Q * 0, for an 
increase in B does not change the W/B ratio and therefore leaves utility 
unchanged. Similarly, in an all-W firm, d^ “ 0. It follows that, as 
in the case of discrimination by substitutes, the long run equilibrium 
is one of perfect segregation and equal wages. 

The corresponding analysis for discrimination by complementary 
employees has not been worked out. It can be conjectured, though, that 
segregation plus possibly competition among foremen with varying 
discriminatory tastes will greatly weaken wage differentials. 
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3. Costs of Adjustment 

V , •:' 

1 Utility-maximization theories then provide a coherent and by no 
means unreasonable account of the effect of discriminatory tastes on 
the market in the short run. Yet they become unsatisfactory in the 
long run. I propose as a possible explanation that long run adjustment 
processes do not work as perfectly as they are usually assumed to. 

When there are significant nonconvexities, the adjustment processes 
called for must be very rapid indeed; marginal adjustments are punished, 
not rewarded. In the case of discrimination by substitute workers, the 
firm would have to be willing to fire its entire all-W labor force and 
replace it by an all-B labor force or vice versa in response to a very 
small change in wages. It is not unreasonable to assume that there are 
costs, not ordinarily taken account of, which will restrain the firm 
from being quite so free in its adjustment behavior. 

Now the idea that adjustment is costly is one that has appeared in 
several diverse areas of economics. The costs of growth enter explicitly 
in some versions of the dynamic theory of the firm (Penrose [1959]; 

Marris [1964]). That is, the growth of the firm imposes a cost, which 
depends on the rate of growth and which is additional to the purchase 
of capital goods. 

The same principle, that capital costs of an unconventional kind 
play an important role in economic behavior and decisions, has been 
applied to the study of labor turnover, a problem more closely connected 
with outs. Operations researchers, in trying to draw up plans for the 
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hiring of personnel, have incorporated in their models a fixed cost of 

i" ' ' ’ 

hiring an individual. Sometimes it is also held that there is a cost 
attached to firing as well. These costs are partly in administration, 

£v - 

partly in training. Even in the case of workers who have already been 
generally trained in the kind of work to be done, there is a need for 
learning the ways of the particular firm. This approach, it has been 
argued by some, has important general economic implications; it implies 
that firms should not adjust their labor force to cyclical shifts in 
demand, since they may incur both hiring and firing costs if they do, 
costs that are avoided if the worker is retained during slack periods. 
Workers are being held in employment even though they contribute little 
to output to avoid the costs of rehiring them in the expected future 
boom, I do not know myself whether this explanation is in fact adequate 

r 

but merely note that it is seriously considered. 

I suggest that a similar consideration explains why the adjustments 
which would wipe out racial wage differentials do not occur or at least 
are greatly retarded. We have only to assume that the employer makes 
an investment, let us call it a personnel investment , every time a worker 
is hired, He makes this investment with the expectation of making a 
competitive return on it; if he himself has no discriminatory feelings, 
the wage rate in full equilibrium will equal the marginal product of 
labor less the return on the personnel investment. Let us consider the 
simplest of the above models, that of discrimination by fellow employees 
who are perfect substitutes. If the firms starts with an all-W labor 



force, it will not find it profitable to fire that force, in which its 
personnel capital has already been sunk, and hire an all-B force in 
which a new investment has to be made simply because B wages are now 
slightly less than W wages. Of course, if the wage difference is large 
enough, it does pay to make the shift. 

Obviously, in a situation like this, where there are costs to change, 
history matters a good deal. A fully dynamic analysis appears to be very 
difficult, but some insight can be obtained by study of a very special 
case. I here present only the results; the argument will be found in 
Arrow [1971, Technical Note E]. Suppose initially there are no B workers 
in the labor force. Then some enter; at the same time, there is an 
additional entry of W workers, and some new equilibrium emerges. Under 
the kinds of assumptions we have been making, a change, if it occurs 
at all, must be an extreme change, but there are now three kinds of 
extremes, or corner maxima. The typical firm may remain segregated 
W, though possibly adding more W workers, it may switch entirely to a 
segregated B state, or it may find it best to keep its present W workers 
while adding B workers. In the last case, of course, it will have to 
increase the wages of the W workers to compensate for their feelings of 
dislike; but it may still find it profitable to do so because replacing the 
existing * W workers by B workers means wasting a personnel investment. 

If we stick closely to the model with all of its artificial conditions, 
we note that only the all-W firms are absorbing the additional supply of 
W workers, so that there must be some of those in the new equilibrium 
situation. On the other hand, there must be some firms that are all-B 





or else some Integrated firms whose new workers are B’s in order to 

absorb the new B workers. It can be concluded in either case, however, 

1 1 ; 

that there will always remain a wage difference between B and W workers 
in this model. Further, there will be some segregated W firms, Whether 
the remaining firms will be segregated B or integrated depends on the 

v i : ? 

degree of discriminatory feelings by W workers against mixing with B 

lsu. •< . 

workers. 

I have not worked out the corresponding analysis for the case where 

there are several types of workers with different degrees of discriminatory 

feelings against racial mixtures in the complementary types. Nevertheless, 

one surmises easily that similar conditions will prevail. 

The generalization that may be hazarded on the basis of the discussion 

thus far can be stated as follows. If we start from a position where 

B workers enter an essentially all-W world, the discriminatory feelings 

by employers and by employees, both of the same and of complementary 

types, will lead a difference in wages. The forces of competition and 

the tendency to profit-maximization operate to mitigate these differences. 

However, the basic fact of a personnel investment prevents these 

counteracting tendencies from working with full force. In the end, we 

3 

remain with wage differences coupled with tendencies to segregation. 


3 

The preceding five paragraphs have been quoted, with minor alterations, 
from Arrow [1971, pp. 18-20]. 
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4• Imperfect Information 

There is an alternative interpretation of employer discrimination. 

It can be thought of as reflecting, not tastes, but perception of 

» ti, V *S ■ ' ' 

reality. That is, if employers have preconceived ideas that B workers 

v? f 

have lower productivity than W workers, they may be expected to be 
willing to hire them only at lower wages. One must examine in detail 
the conditions under which this argument is possible to maintain, that 
is, the conditions under which the effects of these preconceptions are 
the same as those of discrimination in the strict sense of tastes. 

First of all, the employer must be able to distinguish W workers 

from B workers. More precisely, the cost of making the distinction 

c . >, 

should be reasonably low. An employer might derive from his reading 
the opinion that an employee with an unresolved Oedipus complex will be 
disloyal to him as a father-substitute; but if the only way of determining 
the existence of an unresolved Oedipus complex is a psychoanalysis of 

4 , - . 

several years at the usual rates, he may well decide that it is not worth 
while for him to use this as a basis for hiring. Skin color and sex are 
cheap sources of information. Therefore prejudices (in the literal sense 
of pre-judgments, judgments made in advance of the evidence) about such 

:.i -V '■ 

differentia can be easily implemented. School diplomas undoubtedly play 
an excessive role in employer decisions for much the same reason. 

Second, it must be that the employer must incur some cost before he 
can determine the employee’s true productivity. If it could be determined 
costlessly, there would be no reason to use surrogate information, 
necessarily less valid even under the most favorable conditions. 



I suppose, therefore, that the employer must hire the employee first and 
thep incur a personnel investment cost, as discussed in the last section, 
before he can determine the worker's productivity. This personnel 
investment might, for example. Include a period of training, only after 
which is it possible to ascertain the worker's productivity; or indeed 
it may be only a period of observation long enough for reliable determination 
of productivity. In the absence of a personnel investment cost, after all, 
the employer could simply hire everyone who applied and fire those 
unqualified, or pay them according to productivity. 

Third, it must be assumed that the employer has some idea or at any 
rate preconception of the distribution of productivity within each of 
the two categories of workers. 

The simplest model to bring out the implication of these assumptions 
seems to be the following. Suppose there are two kinds of jobs, comple¬ 
mentary to each other, say unskilled and skilled. All workers are qualified 
to perform unskilled jobs, and this is known to all employers. Only some 
workers, however, are qualified to hold skilled jobs. The employers need 
make no personnel investment in hiring unskilled workers but must make 
such an investment for skilled workers. The employer cannot know of any 
given worker whether or not he is qualified; however, he does believe that 
the probability that a random W worker is qualified is p w and that a 
random B worker is qualified is p^. An employer will eventually know 
whether or not a worker hired for a skilled position is in fact qualified, 
but this information is not available to other employers. He thus can 
count on keeping the qualified workers he hires. 
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duiLet r be the necessary return per worker on the personnel investment 
for skilled jobs. If a W worker is hired, then with probability p w he 
is qualified; his productivity is MP g , the marginal productivity of 
skilled workers, but the employer must pay a wage, w y , so that the net 
gain to the employer is MP g - w w » On the other hand, if the worker hired 
turns out to be unqualified, the employer receives nothing. Hence, the 
expected return to a W worker hired is (HP g “ p^. If the employer 

is risk-neutral, this must be equal to r. Similarly, 

r - (MPg - w B > P B , (13) 

and therefore, 

v w w - q w fi + (1-q) MP g , 

where q - P B /0y * Thus, if, for any reason, p fi < P w » w y is a weighted 
average of w_ and MP C and therefore lies between them; since from (13) 
we must have w n < MP e ((in order that the employer recoup his personnel 
investment), it follows that w^ > Wg, i*e», the effect of the differential 
judgment as to the probability of being qualified is reflected in a wage 
differential. 

If there are price rigidities which prevent w fl from falling much 

below w , the same forces may be reflected in a refusal to hire B workers 
W 

at all for skilled jobs. 

Once we shift the explanation of discriminatory behavior from un¬ 
analysable (or at any rate unanalyzed) tastes to beliefs, we are led to 
seek to explain these beliefs. One possible explanation runs in terms of 
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theories of psychological equilibrium, of which Festinger*8 theory of 
cognitive dissonance [1957] is one of the most developed. The argument 

J-Wi.f . 1 r, ; 

la that beliefs and actions should come into some sort of equilibrium; 

llK: i t ' 

in particular, if individuals act in a discriminatory manner, they will 

V i V, . ■ 

tend to acquire or develop beliefs which Justify such actions. Hence 
discriminatory behavior and beliefs in differential abilities will tend 

|v * ... 

to come into equilibrium. Indeed, the very fact that there are strong 
ethical beliefs which are in conflict with discriminatory behavior will, 
according to this theory, make the employer even more willing to accept 
subjective probabilities which will supply an appropriate justification 
for his conduct. 

Finally, one can also seek explanations in which p^ and p^ differ 
in reality, even though the intiinsic abilities of W and B workers are 
identical. Such an explanation requires some further assumptions. 
Specifically, whether or not a worker is qualified is now taken to be 
the result of a decision by him, rather than some type of intrinsic 
ability. More specifically, a worker becomes qualified by making some 
type of Investment in himself. In accordance with the previous assumptions, 
this investment must not be observable by the employer. Hence, the 
investments are not the usual types of education or experience, which 
are observable, but more subtle types of personal deprivation and deferment 
of gratification which lead to the habits of action and thoughc that favor 
good performance in skilled jobs, steadiness, punctuality, responsiveness 


and initiative. 
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Flnally, it must be assumed, as is reasonable, that the human capital 
needed to qualify cannot be acquired on a perfect capital market. It 
follows that the proportion of either group (W or B) who qualify is an 
increasing function of the gain from qualifying. In accordance with our 
basic assumption that there is no intrinsic productivity difference 
between W and B workers, we assume that the supply schedules for the two 
groups are the same. Specifically, let " w y ~ Wy be the gain to a 
W worker from qualifying, where Wy is the wage rate for unskilled labor, 
and similarly let v n « w_ - w TT . Then we postulate an increasing function, 
S(v), such that 


P W “ S( V’ P B “ S( V 


(15) 


Let MPy be the marginal productivity of unskilled labor, so that, 


HP * w . 
U U 


(16) 


Note that MP„ and MP„ are determined by the supplies of skilled and 
unskilled labor and these in turn are determined by the proportions p y 
and p_. Hence the system consisting of the equations (13), (15) and (16) 
plus the equation obtained from (13) by replacing B by W constitute a 
system of equations in the unknowns w^, Wg» Wy, p^, and py. 

From the symmetric formulation of the system, it Is clear that there 
is a symmetric equilibrium, i.e., an equilibrium in which p^ ■ Pg and 
w^ * Wg, However, it remains an open question whether this is the only 
equilibrium. It has been possible however to ask whether or not the 
summetric equilibrium is stable. The answer turns out to be that it 
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depends on the parameters of the problem; It Is certainly possible that 
phi# equilibrium be unstable, a result which strongly suggests, though 
it does not prove, that there are equilibria other than the symmetric, 
non-discriminatory, one. 

Intuitively, consider a possible sequence of events, in which 
initially p^ slightly exceeds p fi for some reason. Then w^ slightly 
exceeds w^ and therefore, from (15), p^ tends to rise relative to Pg, 
therefore reinforcing the original disequilibrium. This verbal argument 
is certainly not conclusive nor very convincing. It is necessary to 
specify the dynamic model more precisely and then calculate the stability 
conditions. We suppose that for given p w and p 0 , short-run equilibrium 
works Itself out so quickly as to be instantaneous. The basic dynamics 
then are Marshallian; that is, p w adjusts itself over time to the desired 
level, S(Vy), and similarly with pg. In symbols, 

dp w /dt » k[S(v w ) - p w 3 , 

and a similar equation for p^. This gives a pair of differential equations. 

O 

We can study their stability in the neighborhood of the non-discriminatory 
equilibrium. While the algebra involved is elementary enough, there seems 
no way of making the result intuitively obvious. Hence, we simply 
reproduce the stability condition here, referring the reader for proof 
to Arrow [1971, Technical Note F]. 

Let w„ be the common value of w TT and w„ at the non-discriminatory 

b W D 

equilibrium, p the common value of p w and p fi . Then v * w g - w y is the 
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common value of v TT and v„. Let E be the elasticity of S(v) with respect 

W D 

to v, computed at the symmetric equilibrium value of v. From (13), 

MP - w c ■ r/p at the symmetric equilibrium; it is the excess of marginal 
product over wages for skilled workers. Then the condition for stability 
turns out to be that, 

E(MPg - Wg)/(w s - Wy) < 1 . 

As might be expected, the greater the elasticity of the supply 
schedule for qualified labor, the more likely is the system to be unstable. 
Similarly, the greater the difference between marginal product and wage 
for skilled workers, the more likly is instability; this difference would 
be zero if there were no personnel investment costs for skilled workers, 
and then the system would certainly be stable. Finally, and less 
intuitively, the larger the wage gap between the two types of labor, the 
less likely is instability. 

I believe these results are only the barest fragment of what could be 
found with better and more detailed systems in which there is an inter¬ 
action between reality and perceptions of it. One must consider still 
more precisely how individual employers acquire knowledge which will 
modify their initial estimates of distributions as differing between 
groups and in turn the effects of these perceptions on the market and 
therefore on any incentives to modify those abilities. 
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APPENDIX 


Nonconvexity of Indiffefenee Maps Depending on Ratios 

We suppose that employer discrimination is determined by a utility 
function* U(it, B, W), where multiplying both B and W by the same 
positive constant leaves utility unchanged. We also assume that U is 
an increasing function of profits, x. It will be shown that the 
indifference map defined by U cannot have convex indifference surfaces; 

specifically, a convex combination of two indifferent points is not 

* . 

everywhere at least as good as either. 

Choose any point (it , B , W ). Then choose rr, < tt (as close as 

' 1 O O O X O 

needed) and B^, W^ so that 

UC^, Wj) =■ U(ir o> B o , W q ). (1) 

From the assumptions made, (1) will continue to hold if B^ and are 
reduced in the same proportion. Hence, B^ and can be chosen 
arbitrarily small. 

If the indifference map defined by U has everywhere convex indifference 

surfaces, then the average of the two points must be at least as good 

as (tt , B , W ) , That is, 
o o o 

U(TT', B', W') = U(» Q , B o , W o ), 

where n' - 1/2*. + l/2n , B' - 1/2B + 1/2B . V - 1/2W, + 1/2W. 

1 O X O X o 

But then, since, 

U(tt’, 2B', 2W') » U(tt', B', W'), 

U(tt‘, 2B', 2W*) - U(tt q , B q , W q ) , 


we have 



or, by definition, 

B i + B o* w i + v - u( v v v- 


tttrhu 

v*- 


But and can be chosen as small as desired. Let them approach 
0} by continuity, 


U(w\ B o , W o ) - U(n Q m B q> W q ) , 


which is a contradiction to the assumption that U is increasing in 


7T * < IT . 

O 


i 


it, since 



